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Abstract 

We  study  competitive  on-line  scheduling  in  multi-processor  real-time  environments.  In  our 
model,  every  task  has  a  deadline  and  a  value  that  it  obtains  only  if  it  completes  by  its  deadline. 
A  task  can  be  assigned  to  any  processor,  all  of  which  are  equally  powerful.  The  problem  is  to 
design  an  on-line  scheduling  algorithm  (i.e.,the  scheduler  has  no  knowledge  of  a  task  until  it  is 
released)  with  worst  case  guarantees  as  to  the  total  value  obtained  by  the  system. 

We  study  systems  with  two  or  more  processors  and  with  uniform  or  non-uniform  value  den¬ 
sity.  We  present  an  inherent  limit  on  the  best  competitive  guarantee  that  any  on-line  parallel 
real-time  scheduler  can  give.  Then  we  present  a  competitive  algorithm  that  achieves  a  worst 
case  guarantee  which  is  only  within  a  factor  of  2  from  the  best  possible  guarantee  in  many  cases. 
These  are  the  most  general  results  yet  known  for  parallel  overloaded  real-time  scheduling. 


1  Introduction 

In  modern  life,  real-time  computer  systems  are  gaining  importance  at  a  rapid  pace.  Once  limited 
to  exotic  applications,  real-time  applications  now  can  be  found  in  many  civilian  and  military  prod¬ 
ucts.  These  range  from  multi-million  dollar  gadgets  like  (the  proposed)  space  stations  to  relatively 
mundane  products  like  cars  and  airplanes.  Real-time  systems  control  the  production  and  safety  in 
power  plants,  factories,  labs  and  perhaps  soon  in  our  home  alarm  systems. 

A  firm  overloaded  real-time  system  is  one  for  which  even  a  clairvoyant  scheduler  cannot  meet 
all  deadlines  with  the  available  computational  resources.  Overload  can  arise  either  as  the  result  of 
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failures  of  some  computational  resources  or  as  a  transient  condition  (e.g.,  in  telecommunications 
and  multimedia).  In  a  parallel  setting,  it  is  possible  to  tolerate  failures  and  still  meet  the  deadlines 
of  many  high-value  tasks.  To  do  this,  we  must  have  algorithms  that  can  adapt  to  the  number  of 
processors  available  and  give  performance  guarantees. 

Recently,  several  groups  of  researchers  (including  us)  have  presented  complexity  results  and  al¬ 
gorithms  that  give  guarantees  for  overloaded  real-time  systems.  The  current  paper  extends  our 
previous  uniprocessor  results  to  parallel  architectures.  We  discuss  both  uniform  shared  memory 
models  (where  thread  migration  is  cheap  and  scheduling  is  global)  and  non-uniform  shared  memory 
models  (where  thread  migration  is  impractical  but  scheduling  is  still  global).  By  “migration”  we 
mean  the  ability  to  move  a  thread  that  has  already  begun  execution  from  one  processor  to  an¬ 
other.  For  both  models,  we  assume  that  preemption  within  a  processor  takes  no  time  (a  reasonable 
assumption  for  real-time  kernels). 

In  our  work-flow  model,  the  scheduler  is  given  no  information  about  a  task  before  its  release  time. 
When  a  task  is  released,  its  value,  computation  time  and  deadline  are  known  precisely.  If  a  task 
completes  before  its  deadline,  then  the  system  acquires  its  value.  Otherwise,  the  system  acquires 
no  value  for  that  task.  This  is  known  as  a  firm  on-line  real-time  model.  The  goal  of  the  scheduler 
is  to  obtain  as  much  value  as  possible. 

The  value  density  of  a  task  is  its  value  divided  by  its  computation  time.  The  importance  ratio 
of  a  collection  of  tasks  is  the  ratio  of  the  largest  value  density  to  the  smallest  value  density.  For 
convenience,  we  normalize  the  smallest  value  density  to  be  1.  When  the  importance  ratio  is  1,  the 
collection  is  said  to  have  uniform  value  density,  i.e.,  a  task’s  value  equals  its  computation  time.  We 
will  denote  the  importance  ratio  of  a  collection  by  k. 

We  choose  to  quantify  the  performance  guarantee  of  an  on-line  scheduler  by  comparing  it  with  a 
clairvoyant  ( also  called  off-line)  scheduling  algorithm.  A  clairvoyant  scheduler  has  complete  a  priori 
knowledge  of  all  the  parameters  of  all  the  tasks.  A  clairvoyant  scheduler  can  choose  a  “scheduling 
sequence”  that  will  obtain  the  maximum  possible  value  achievable  by  any  scheduler  1. 

As  in  [2,6,11]  we  say  that  an  on-line  algorithm  has  a  competitive  factor  r ,  0  <  r  <  1,  if  and 
only  if  it  is  guaranteed  to  achieve  a  cumulative  value  of  at  least  r  times  the  cumulative  value 
achievable  by  a  clairvoyant  algorithm  on  any  set  of  tasks.  For  convenience  of  notation,  we  use 
competitive  multiplier  as  the  figure  of  merit.  The  competitive  multiplier  is  defined  to  be  “one  over 
the  competitive  factor”.  The  smaller  the  competitive  multiplier  is  the  better  the  guarantee  is.  Our 
goal  is  to  devise  on-line  algorithms  with  worst  case  performance  guarantee. 

For  uniprocessor  environments  with  an  importance  ratio  k ,  Baruah  et.  al.  [2,3]  showed  a  lower 
bound  of  (1  +  \/k)2.  Wang  and  Mao  [12]  first  reported  an  algorithm  that  achieves  this  bound  when 
A'  is  1.  Having  independently  developed  an  algorithm  for  the  k  =  1  case,  we  were  able  to  generalize 
this  to  an  algorithm  called  J)°ver  that  meets  the  Baruah  et  al.  bound  for  all  k  [7]. 

For  multi  processor  environments,  Mok  and  Dertouzos  [4]  showed  that  no  optimal  algorithm  ex¬ 
ists  even  when  the  system  is  underloaded.  Locke  ([9]  pp.  124-134)  presented  heuristics  for  this  case. 

1  Finding  the  maximum  achievable  value  for  such  a  scheduler,  even  in  the  uniprocessor  case,  is  reducible  from  the 
knapsack  problem  [5];  hence  is  NP-hard. 
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Zhou  et.  al.  presented  an  on-line  algorithm  [13]2  for  distributed  real-time  systems.  Their  model 
resembles  ours  but  our  goal  is  to  give  worst  case  guarantees  for  value  obtained  (even  for  overloaded 
systems)  while  their  goal  is  to  generate  a  schedule  efficiently  when  the  system  is  underloaded  (i.e, 
all  tasks  can  be  scheduled). 

Wang  and  Mao  [3,12]  showed  a  lower  bound  of  2  (on  the  competitive  multiplier)  and  presented  an 
algorithm  that  achieved  this  bound  for  an  arbitrary  even  number  of  processors,  assuming  uniform 
value  density  and  no  slack  time. 

We  present  here  algorithms  and  complexity  results  for  tasks  having  slack  time,  executing  on 
an  arbitrary  number  of  processors,  and  with  arbitrary  importance  ratios.  The  gap  between  the 
performance  guarantees  of  our  algorithms  and  the  complexity  results  shows  that  there  is  still  work 
to  be  done.  Our  adversary  arguments  and  algorithms  offer,  however,  three  useful  insights: 

1.  A  parallel  on-line  scheduling  algorithm  improves  its  competitive  multiplier  by  devoting  some 
of  its  processing  resources  to  every  possible  value  density. 

2.  The  lower  bound  on  the  best  possible  competitive  multiplier  (as  measured  by  our  adversary 
arguments)  converges  to  In  k  as  the  number  of  processors  approaches  infinity.  Our  current  al¬ 
gorithm  gives  a  guarantee  that  converges  to  2  In  k  +  3  as  the  number  of  processors  approaches 
infinity. 

3.  It  should  be  easier  to  design  a  competitive  on-line  scheduler  when  tasks  are  assumed  to  have 
no  slack  time.  However,  in  all  the  multi-processors  scheduling  algorithms  that  we  studied, 
the  introduction  of  slack  time  entails  only  a  fixed  deterioration  of  the  competitive  multiplier. 
The  same  fixed  deterioration  is  observed  when  comparing  systems  with  cheap  migration  vs. 
systems  where  migration  is  impossible. 

This  paper  is  organized  as  follows:  Section  2  summarize  the  main  results.  In  section  3,  we 
present  our  complexity  results  using  a  novel  adversary  argument.  From  section  4  onwards,  we 
describe  our  algorithmic  results.  Section  4.1  states  the  Lost  Value  Lemma  that  we  use  frequently 
to  relate  results  in  the  case  where  migration  is  both  permitted  and  cheap  to  the  case  where  it  is 
not  possible.  In  sections  5  and  6  we  describe  and  analyze  the  Cascading  Multi-Band  Algorithm. 
Our  results  for  two-processors  systems  are  presented  in  appendix  A.  The  paper  ends  with  a  brief 
conclusion  and  a  discussion  of  some  salient  open  problems. 

2  Summary  of  Results 

We  present  algorithms  and  complexity  results  for  multiprocessor  scheduling  of  overloaded  real-time 
systems. 

1.  For  a  system  with  n  processors  and  maximal  value  density  of  k  >  1,  there  is  no  on-line 
scheduling  algorithm  with  competitive  multiplier  smaller  than  pzr[yn(&"  —  1). 

When  n  tends  to  infinity  this  lower  bound  tends  to  In  k. 

2  And  additional  references  within. 
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2.  We  present  an  algorithm  called  the  Cascading  Multi-Band  Algorithm.  For  a  system  with  2 n 
processors  and  importance  ratio  of  k  >  1,  this  algorithm  has  an  algorithmic  guarantee  of  at 
most 


1  +  2  n  min 

(0<U!  <n;n=U>  +ll>) 


<  max 

1  <  *  <  V’ 


ftf 


When  n  tends  to  infinity  this  bound  is  at  most  2  In  ft  +  3,  which  is  within  a  multiplicative 
factor  of  2  from  the  complexity  bound  for  the  same  system. 

3.  We  present  an  algorithm  called  the  Safe-Risky  algorithm,  for  two-processor  systems  with 
uniform  value  density  that  achieves  the  best  possible  competitive  multiplier  of  2  even  when 
tasks  may  have  slack  time  but  migration  is  allowed  3.  For  the  “No- Migration”  model,  a 
variant  of  this  algorithm,  called  the  Safe-Risky-(fixed),  achieves  a  competitive  multiplier  of 
3. 


3  The  Complexity  Bounds 

We  would  first  like  to  show  that  no  on-line  algorithm  can  have  a  competitive  multiplier  smaller 
than  jjzjj n ( ft ^  ~  1)-  As  usual  in  proofs  of  this  kind,  we  assume  that  an  adversary  can  choose  a 
set  of  tasks.  The  set  depends  upon  the  on-line  algorithm’s  behavior. 

We  consider  n  + 1  possible  levels  of  value  density  1 .  ft  <• .  ft  <* ,  •  •  • .  ft  »  =ft,  call  them  levels  0, 1,  •  ■  ■ ,  n. 

With  each  level  we  associate  a  period.  A  task  of  some  level  of  value  density  will  have  a  com¬ 
putation  time  and  deadline  equal  to  the  corresponding  period.  Hence,  the  value  of  a  task  of  level 
i  equals  the  length  of  the  «’th  period  times  the  *’ th  value  density.  The  length  of  the  O’th  level’s 
period  is  set  to  1.  We  choose  all  other  periods  in  such  a  way  that  the  value  of  an  i  +  1  ’th  level 
task  is  only  a  small  fraction  of  the  i’th  level  task’s  value.  In  fact,  we  choose  it  so  the  i  +  1  ’th  task’s 
“effective  value  density”  taken  over  the  i’th  period  is  arbitrarily  small  (say  e  for  some  small  positive 

<0- 

A  collection  of  tasks  that  has  n  identical  tasks  for  each  level  is  called  a  complete  set.  In  this  game, 
the  adversary  release  tasks  only  in  complete  sets.  The  behavior  of  the  on-line  scheduler  dictates 
when  the  next  complete  set  is  to  be  released. 

Let  1 1  be  the  time  when  the  Z ’th  set  was  released  (hence,  always  1. 1  =  0).  At  time  ti  (in  particular 
at  time  0),  the  on-line  algorithm  has  to  schedule  a  complete  set.  The  number  of  possible  scheduling 
decisions  is  vast.  However,  since  the  number  of  processors  is  smaller  than  the  number  of  levels, 

3This  was  already  known  when  tasks  have  no  slack-time  [3,12]. 
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at  least  one  level  is  not  chosen  by  the  on-line  scheduler.  Let  i0  be  an  index  of  some  level  (to  be 
specified  later)  that  is  not  chosen.  Then,  fj+i  is  set  to  be  the  end  of  the  current  *o’th  level  period 
(this  means  that  up  to  that  time  there  will  be  no  new  task  releases).  We  will  say  that  ti  is  associated 
with  level  i0. 

Suppose  that  //  is  associated  with  level  i0,  then  what  can  the  clairvoyant  scheduler  do?  It  can 
execute  n  tasks  of  level  i0  to  completion  between  /;  to  f/+1.  In  this  scheme,  the  clairvoyant  scheduler 
schedules  all  the  processors  in  the  same  way,  no  processor  is  ever  idle  and  all  current  tasks  complete 
immediately  before  a  new  set  is  released. 

At  least  one  but  possibly  several  value  densities  could  be  missing  from  the  on-line  schedule.  We 
will  start  by  proving  results  assuming  that  only  one  level  is  missing.  Later,  we  will  produce  a 
general  scheme  to  choose  i0  among  all  absent  density  levels. 

Lemma  3.1  Suppose  the  on-line  scheduler  gets  partial  value  for  tasks  that  were  executed  but  not 
to  completion.  In  particular,  for  a  task  that  executed  for  a  (0  <  a  <  1)  of  its  computation  time  the 
scheduler  will  get  a  times  the  task’s  value. 

If  only  one  density  level  is  missing  from  the  schedule  at  time  t[,  then  the  effective  value  density 
(i.e,  the  value  achieved  divided  by  the  period’s  length)  obtained  by  the  clairvoyant  scheduler  between 
ti  and  ti+i  is  at  least  jj^^n{k~  —  1)  times  bigger  than  the  effective  value  density  obtained  by  the 
on-line  scheduler  for  the  same  period. 

PROOF. 

Let  io  be  the  level  associated  with  the  period  in  question.  The  clairvoyant  algorithm  schedules  to 

*0. 

completion  n  tasks  of  level  «o  and  gets  an  effective  value  density  of  k  «  on  all  processors. 

We  know  that  during  this  period  the  on-line  scheduler  did  not  execute  any  task  of  level  i0.  In¬ 
stead,  it  scheduled  tasks  of  lower  and  higher  levels.  The  effective  value  density  of  any  tasks  of 
higher  level  is  much  smaller  than  its  value  density  because  of  their  short  period.  In  fact  all  these 
tasks  have  effective  value  density  of  at  most  e.  Hence,  the  effective  value  density  achieved  by  the 
on-line  scheduler  is  at  most 


,  ,1,2.  ,  »0~1 

1+A;  *'  +  &«+■••  +  &  »  +  e  +  e  +  e 

' - V - ' 

n—io  times 

l  2  *o— i 

—  n  (ti  — 


We  are  looking  for  the  smallest  possible  ratio  between  the  effective  value  densities  of  the  clair¬ 
voyant  and  the  on-line  scheduler.  That  is, 


min 

0<«o 


nk~ 


lQ—1 


&  «  +  (n  —  i0)e) 
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The  above  term  monotonically  decreases  when  i0  increases,  hence  the  minimum  is  obtained  when 
io  =  n  and  its  value  is 


nk 


1  +  k  n  -)-  k  n  +  •  ■  ■  + 


The  sum  of  the  geometric  sequence  in  the  denominator  is 

is, 


\  1  hence  the  smallest  possible  ratio 

kn-l 


and  the  lemma  is  proved.  □ 


k 

k  -  1 


1) 


The  preceding  lemma  dealt  with  the  special  case  that  only  one  value  density  level  is  missing  from 
the  on-line  schedule.  But  what  will  happen  if  more  than  one  level  is  missing?  In  appendix  B  we 
show  that  this  can  not  benefit  the  on-line  scheduler  (for  a  “good”  choice  of  io).  Hence,  we  get  the 
following  theorem. 


Theorem  3.2  For  a  system  with  n  processors  and  maximal  value  density  of  k,  there  is  no  on-line 
scheduling  algorithm  with  competitive  multiplier  smaller  than  ^  n{k~  —  1). 

PROOF. 

Fix  an  on-line  scheduling  algorithm.  Denote  by  V(t{)  the  value  obtained  by  the  on-line  scheduler 
until  time  t[.  Lemma  B.3  shows  that  the  value  obtained  by  the  clairvoyant  algorithm  is  at  least 

(S=iJ »(*"  ~  1  )V{M)- 

Note  that  /;  tends  to  infinity  as  l  goes  to  infinity.  If  V(ti)  does  not  tend  to  infinity  as  l  goes 
to  infinity  then  the  competitive  multiplier  of  the  on-line  algorithm  is  not  bounded  (because  the 
clairvoyant  algorithm  gets  a  value  of  at  least  nti  — *  oc).  Otherwise,  V(ti)  tends  to  infinity,  hence 
for  arbitrarily  small  e  >  0  there  is  a  big  enough  l0  such  that 

V(ti0)  >  \ kn  =>•  kn  <  eV(ti0 ) 

Suppose  the  game  ends  at  f/0  ( i.e. ,  no  more  task  releases).  The  total  value  obtained  by  the 
on-line  scheduler  is  not  greater  than  V(ti0 )  +  kn  (because  all  the  tasks  not  yet  completed  has 
length  at  most  1  and  value  density  at  most  k).  The  clairvoyant  scheduler  gets  a  value  of  at  least 
Jk=F)n(kn  -  l)V(ti0). 

Hence, 


> 


value  obtained  by  the  clairvoyant  scheduler 
value  obtained  by  the  on-line  scheduler 

firry n(kn  -  1  )V(ti0) 

V(ti0 )  +  kn 
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urnn^k'‘  -  mhc) 

~  V{tlo)  +  eV{tlo) 

>  (ir 

1  +  6 

This  holds  for  every  positive  t  hence  the  Theorem  is  proved. 

□ 

Corollary  3.3  As  the  number  of  processors  n  tends  to  infinity,  no  on-line  algorithm  can  have  a 
competitive  multiplier  smaller  than  In  k  (natural  logarithm). 


4  Algorithmic  Guarantees 

We  consider  two  possible  models  of  multiprocessor  systems.  In  the  first  model,  tasks  can  migrate 
cheaply  (and  quickly)  from  one  processor  to  another.  Hence,  if  a  task  started  to  execute  on  one 
processor  it  can  later  continue  on  any  other  processor  (and  migration  takes  no  time).  In  the  second 
model  (the  fixed  model),  once  a  task  starts  to  execute  on  one  processor  it  can  not  execute  on  any 
other  processor.  An  on-line  scheduler  can  do  better  when  migration  is  possible. 

4.1  The  Lost  Value  Lemma 

Let  A  be  an  on-line  scheduler  and  1  a  set  of  tasks  to  be  scheduled.  We  can  partition  the  tasks  of 
T  according  to  the  behavior  of  A 

1.  Tasks  that  never  completed  (F),  the  “lost”  ones. 

2.  Tasks  that  completed  successfully  (5) 


r  =  F  U  S 

Denote  by  V(T)  and  C(T)  the  value  achieved  by  A  and  the  clairvoyant  scheduler  from  the  tasks 

of  r. 

Lemma  4.1  The  Lost  Value  Lemma 

If  for  some  multiplier  c  and  every  set  of  tasks  T, 

C(F)  <  cV(T) 

Then, 
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C(r)  <  (c  +  i)F(r) 


PROOF. 


C(r)  =  C{F  U5)<  C(F )  +  C{S) 

=  C(F)  +  F(S)  =  C(F)  +  F(r) 

<  (c  +  i)v(r) 


□ 

5  The  Cascading  Multi-Band  Algorithm 

Suppose  a  system  lias  2 n  processors.  We  break  the  processors  into  two  disjoint  groups:  2?/;  proces¬ 
sors  will  constitute  a  “band  structure”  and  the  other  2u>  processors  will  constitute  a  “central  pool” 
as  described  below  [n  —  tp  ■ f  w;  and  n  >  u  >  0). 

J_  J_  2_  1p-l 

We  consider  V’  intervals  ( levels )  of  value  density  [l..k  '*/’  ),  [k  ..&  ^  ),■■■,  [&  ..A:],  call  these  levels 

1,  ■  ■  ■ ,  ip  respectively.  The  *’th  band  is  said  to  be  “lower”  than  the  i  +  l’st  band. 

Suppose  the  entire  set  of  tasks  to  be  scheduled  is  T.  We  partition  this  set  according  to  the  value 
density  of  the  tasks:  T  =  Id  U  1?2  •  ■  •  U  I,/.  where  Tj  contains  all  tasks  with  value  density  in  the 

i  —  1  i 

range 

We  allocate  2  processors  ( a  band )  for  each  of  the  value  density  levels.  In  addition,  the  remaining 
2 w  processors  are  allocated  as  a  central  pool ,  that  will  be  used  by  tasks  of  all  levels. 

The  main  idea  of  the  algorithm  is  to  assign  a  task,  upon  its  release,  to  the  band  corresponding 
to  its  value  density.  However,  if  the  band  is  already  overloaded  and  can  not  accommodate  the  new 
task  this  task  will  “try”  the  band  one  below.  If  the  band  below  can  not  accept  the  new  task,  the 
task  will  continue  to  cascade  downwards.  If  a  task  cascades  to  the  lowest  band  but  still  can  not  be 
scheduled  there  it  can  go  into  the  central  pool. 

If  a  newly  released  task  is  accepted  by  one  of  the  bands  or  by  the  central  pool  it  is  guaranteed 
to  complete  before  its  deadline  (these  tasks  are  called  “privileged”).  If  it  is  not,  it  awaits  its  LST0 
( Latest  Start  Time),  at  which  time  it  tries  again  to  be  scheduled  (details  to  follow).  The  algorithm 
has  three  major  components: 

1.  Upon  task  release,  assign  a  task  to  a  band  (possibly  after  cascading). 

V1-* 

4A11  but  the  last  interval  is  half  open  half  closed.  The  last  level  corresponds  to  the  closed  interval  [A  ^  ,  k], 

5 Definition:  LST  =  (deadline  -  remaining  computation  time).  If  a  task  is  not  scheduled  at  its  LST,  it  will  not 
complete. 


2.  At  LST  (of  a  non-privileged  task),  decide  whether  and  where  a  task  should  be  scheduled  or 
maybe  abandoned. 

3.  The  method  used  in  scheduling  each  band  (and  the  central  pool). 

Different  choices  for  these  three  components  would  create  different  variants  of  the  algorithm.  In 
this  paper  we  describe  one  specific  variant  that  we  call  the  Cascading  Multi-Band  Algorithm.  In 
this  variant,  the  central  pool  is  also  broken  into  bands  of  two  processors  each6.  All  the  pairs  (i.e, 
bands)  execute  the  same  two  processor  scheduling  algorithm. 

At  each  moment,  every  band  has  one  of  its  processors  designated  as  the  Safe  Processor  (SP)  and 
the  other  as  the  Risky  Processor  (RP).  Each  band  has  its  own  queue  called  Q-privileged ,  the  tasks 
in  Q -privileged  are  guaranteed  to  complete.  In  addition  to  the  local  Q-privileged  queues  there  is 
one  global  queue  called  Q -waiting .  This  queue  includes  all  the  ready  tasks  that  are  not  privileged. 

When  a  new  task  T  is  released,  it  is  assigned  to  a  band  as  follows: 

1.  It  is  added  to  the  Q -privileged  of  its  own  band  if  this  does  not  create  overload  (i.e,  all  tasks 
including  the  new  task  can  complete  on  SP).  Otherwise,  T  cascades  downward  as  described 
above. 

2.  If  T  was  not  accepted  by  any  band  (including  all  the  bands  in  the  central  pool)  it  enters  the 
Q-waiting  where  it  waits  until  its  LST  occurs. 

So,  at  release  time  only  the  SPs  are  examined.  A  task  might  not  be  scheduled  even  if  an  RP  is 
idle.  A  task  T  that  reached  its  LST  is  assigned  to  a  processor  as  follows: 

1.  If  there  is  any  idle  RP  among  all  the  lower  level  bands  (including  2  "s  own  level)  then  schedule 
T  on  one  of  these  processors1 . 

2.  If  there  is  no  idle  RP  among  lower  level  bands,  we  might  abandon  a  task  executing  on  one  of 
these  RPs  in  order  to  schedule  T,  depending  on  the  following  rule: 

Let  T*  be  the  task  with  earliest  deadline  among  all  the  tasks  executing  on  these  RPs. 
Abandon  the  task  with  earlier  deadline  among  T  and  T*. 

If,  at  task  completion  event,  SP  of  a  band  becomes  idle  while  RP  is  not  then  the  two  processors 
should  switch  roles;  the  safe-processor  becomes  the  risky-processor  and  vice  versa.  This  does  not 
require  task  migration. 

Using  idle  RPs  and  scheduling  tasks  of  Q-waiting  before  they  reach  their  LST  are  heuristics  for 
improving  the  average  case  behavior  of  the  scheduler. 

The  bands  structure  as  described  above  prioritize  high  value  density  tasks  over  low  value  density 
tasks.  Higher  value  density  tasks  start  their  cascading  at  a  higher  point  and  cascading  is  possible 
in  only  one  direction  -  downwards8.  However,  an  algorithm  that  uses  the  “pure”  bands  structure 

6 The  bands  of  the  central  pool  are  ordered  so  that  a  task  that  reaches  the  pool  start  with  the  first  band  in  the 
pool  and  if  not  accepted  it  cascades  to  the  second  band  and  onwards.  If  the  task  is  not  accepted  by  the  last  band  in 
the  pool  (“end-of-the-world”)  it  enters  Q -Waiting. 

'Heuristics  can  be  used  to  choose  the  processor  in  case  that  there  are  more  than  one  idle  RP.  Examples  might  be 
the  one  of  the  lowest  band,  or  maybe  the  highest. 

8  Hence,  higher  value  density  tasks  have  more  bands  that  can  possibly  accommodate  them. 
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(i.e. ,  with  no  central  pool)  can  be  crippled  when  the  task  set  consists  of  mostly  low  value  density 
tasks  since  all  the  higher  bands  will  be  left  idle.  In  order  to  minimize  the  loss  of  such  cases  we  add 
the  central  pool  to  the  bands  structure.  If  all  the  tasks  are  of  low  value  density  then  all  high  bands 
would  still  be  left  idle  but  the  bands  in  the  central  pool  would  be  utilized. 

A  big  enough  central  pool  will  offset  the  damage  caused  by  higher  idle  bands.  However,  mak¬ 
ing  the  central  pool  too  big  can  cause  another  problem — weakening  the  advantages  of  the  higher 
value  density  tasks.  We  conclude  that  choosing  the  right  size  of  the  central  pool  is  a  delicate  and 
important  aspect  of  the  the  Cascading  Multi-Band  Algorithm. 

The  following  is  a  small  example  of  the  Cascading  Multi-Band  Algorithm's  scheduling. 

Example  5.1 

Assume  that  the  highest  possible  value  density  is  16  (k  =  16),  number  of  processors  is  6  from 
which  2  are  allocated  as  a  central  pool  and  the  rest  constitute  2  bands.  The  first  band  will  be  for 
tasks  with  value  density  below  4  and  the  second  for  tasks  with  value  density  of  4  and  above.  For 
this  example,  consider  the  tasks  depicted  in  the  following  table: 


Task 

Release  Time 

Computation  Time 

Slack  Time 

Deadline 

Value  Density 

Ti 

0 

5 

0 

5 

1 

t2 

0 

5 

0 

5 

2 

t3 

1 

5 

0 

6 

3 

t4 

1 

5 

0 

6 

3 

t5 

1 

1 

0 

2 

3 

Tq 

2 

4 

2 

8 

16 

t7 

2 

3 

2 

7 

10 

T8 

3 

2 

1 

6 

10 

t9 

3 

2 

0 

5 

10 

Tj0 

6 

1 

1 

8 

16 

The  first  two  tasks  to  be  released  are  scheduled  on  the  SP  of  the  first  band  and  the  central  pool 
[T2  cascades  into  the  central  pool).  When  X3  is  released  it  can  not  be  scheduled  on  an  SP,  so  it  is 
inserted  into  Q_waiting  only  to  create  an  LST  interrupt  immediately.  Then,  it  is  scheduled  on  the 
RP  of  the  first  band.  In  the  same  way  T4  is  scheduled  on  the  RP  of  the  central  pool.  But  when  X5 
arrives  it  can  be  scheduled  neither  on  any  of  the  SPs  nor  on  any  of  the  RPs,  hence  is  abandoned 
(in  the  LST  routine).  Note  that  T5  is  abandoned  even  though  the  second  band  is  idle  (a  task  can 
cascade  only  downwards). 

All  the  remaining  tasks  have  value  density  high  enough  to  be  scheduled  on  the  second  band.  Tq  is 
scheduled  on  the  SP.  T7  can  not  be  scheduled  on  any  of  the  SPs  and  it  enters  Q_waiting  (with  LST 
at  4).  Tg  can  be  added  to  the  SP  of  the  second  band  preempting  Tq  (which  has  a  latter  deadline). 
T9  can  not  be  scheduled  on  any  of  the  SPs;  it  reaches  its  LST  and  is  scheduled  on  the  RP  of  the 
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Central  Pool 


Figure  1:  The  Cascading  Multi-Band  Algorithm  Scheduling 
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second  band,  but  at  time  4  it  is  abandoned  in  favor  of  T7  which  arrived  to  its  LST  and  has  a  later 
deadline. 

At  time  5,  the  SP  of  the  first  band  becomes  idle,  which  creates  a  switch  of  the  roles  between  the 
SP  and  RP  of  that  band.  Later  at  time  6,  T10  is  released;  it  can  not  be  scheduled  on  its  own  band’s 
SP  but  after  cascading  it  is  scheduled  on  the  (new)  SP  of  the  first  band. 

All  in  all,  the  Cascading  Multi-Band  Algorithm  completed  all  the  tasks  but  T5  and  Tg.  A  clair¬ 
voyant  scheduler  could  schedule  all  the  tasks  ( T5  can  be  scheduled  on  the  idle  SP  and  Tg  can  be 
scheduled  before  its  LST  on  the  same  processor).  Figure  1  shows  the  schedule  created  by  the 
Cascading  Multi-Band  Algorithm. 


6  The  Algorithm’s  Competitive  Multiplier 

In  this  section  we  would  like  to  study  the  behavior  of  the  Cascading  Multi-Band  Algorithm  and  to 
compute  its  competitive  multiplier.  The  final  result  is  stated  in  theorem  6.2. 

Recall  that  1  is  the  entire  set  of  tasks  to  be  scheduled.  We  partition  the  tasks  of  1  according  to 
the  behavior  of  the  Cascading  Multi-Band  Algorithm :  tasks  that  never  completed  ( F )  and  tasks 
that  completed  successfully  (S).  Denote  by  Lr(r)  and  C(r)  the  value  achieved  by  the  Cascading 
Multi-Band  Algorithm  and  the  clairvoyant  scheduler  from  scheduling  I  ,  respectively.  We  will  show 
that  C(F )  <  aC(r)  (for  some  constant  alpha),  using  the  Lost  Value  Lemma  (4.1)  we  will  get  a 
competitive  multiplier  of  a  -(-  1. 

Before  we  start  we  must  introduce  some  notation  and  definitions. 

Definition  6.1 


•  Productive  Band:  A  band  is  said  to  be  productive  at  time  t  if  at  that  time,  its  SP  is  not 
idle. 

•  Executable  Period:  The  executable  period ,  of  a  task  is  the  interval  between  its  release 
time  and  its  deadline. 

By  definition,  a  task  may  be  scheduled  only  during  its  executable  period. 

•  Cumulative  Value  Density  (CVD):  Suppose  some  schedule  is  chosen,  the  cumulative 
value  density  at  time  t  is  the  sum  of  the  value  densities  of  all  tasks  executing  at  time  t.  9 

□ 

Lemma  6.1  If  at  time  I  a  task  with  deadline  d  is  executing  on  RP  of  a  band  (i.e  this  task  was 
scheduled  by  an  LST  interrupt)  then  that  band  will  be  productive  between  t  and  d. 

9 For  example,  for  a  system  with  2 n  processors,  if  all  processors  are  idle  at  time  t  then  CVD(t)  =  0.  If  half  of 
the  processors  (i.e,  n)  execute  tasks  with  unit  value  density  and  the  others  execute  tasks  of  value  density  k  then  the 
cumulative  value  density  is  n  +  kn.  In  no  case  can  C'VD(t)  be  bigger  than  2 nk. 
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PROOF. 


If  SP  does  not  become  idle  before  time  d  then  by  definition  the  band  is  productive  between  t  and 
d.  Otherwise,  suppose  SP  becomes  idle  at  time  5,  t  <  s  <  d,  then  there  must  be  a  task  executing 
on  RP  at  time  s  (because  a  task  on  RP  can  be  abandoned  only  in  favor  of  another  task  with  a  later 
deadline  and  no  slack  time).  So,  at  time  s  RP  becomes  SP  and  it  would  not  become  idle  before 
time  d  because  the  deadline  of  the  current  task  is  at  least  d  (and  it  has  no  slack  time). 

□ 

Here  are  a  few  things  to  notice  about  the  Cascading  Multi-Band  Algorithm: 

1.  At  any  band  i,  only  tasks  of  level  i  or  higher  can  be  executed. 

2.  If  a  task  T  of  level  i  is  abandoned  then  band  i  and  all  lower  bands  (including  the  central 
pool)  are  productive  during  the  entire  executable  period  of  T . 

PROOF. 

Let  T  be  T(  r,  c,  d)10.  Upon  T’s  release  it  was  not  accepted  by  any  of  the  levels  on  or  below 
i.  This  means  that  for  each  of  these  bands,  the  tasks  currently  in  (the  local)  Q_privileged 
will  execute  at  least  until  d—  c  (otherwise  T  could  become  privileged).  This  proved  that  all 
bands  are  productive  between  r  and  d  —  c. 

However,  d  —  c  is  the  LST  of  T .  At  its  LST,  T  would  not  be  scheduled  only  if  every  band  (on 
or  below  the  Pth)  has  a  task  currently  executing  on  its  RP  with  deadline  after  d.  This  means 
that  all  bands  are  productive  between  d  —  c  and  d  (lemma  6.1)  Combining  the  two  gives  the 
desired  result. 

□ 

3.  Once  a  task  starts  to  execute  on  some  processor,  it  will  never  migrate  to  another  processor. 

At  any  given  time  t,  consider  all  the  tasks  of  F  for  which  t  is  in  their  executable  period.  Let 
high(t)  be  the  value  density  level  corresponding  to  the  task  with  the  highest  value  density  among 
all  these  tasks. 

Suppose  the  clairvoyant  scheduler  has  to  schedule  only  the  tasks  of  F,  and  suppose  it  had  chosen 
some  optimal  schedule  for  these  tasks.  At  time  l.  the  best  the  clairvoyant  scheduler  can  hope  for 
(looking  only  at  time  t)  is  to  have  all  2 n  processors  executing  tasks  of  level  high(t),  i.e,  with  value 

high(t ) 

density  not  greater  than  k  V’  .  We  conclude  that  the  cumulative  value  density  of  the  clairvoyant 

high(t) 

schedule  at  time  t  is  bounded  by  2  nk  V’  . 

10I.e.,  released  at  time  r  with  deadline  d  and  computation  time  c. 
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The  facts  that  a  task  of  level  high(t)  was  abandoned  and  that  t  is  in  its  executable  interval  imply 
that  at  time  t,  all  bands  up  to  (and  including)  high(t)  were  productive.  This  means  that  the  on-line 
scheduler  has  a  cumulative  value  density  of  at  least11: 


1  2  high{t)  —  1 

io  +  i  +  k^'  +ki'  +...  +  *  i' 


=  U)  + 


high(t ) 

(k  V>  -  1) 


_L_ 

0  -  1) 


This  leads  to  the  following  theorem. 


Theorem  6.2  For  a  system  with  2 n  processors  and  maximal  value  density  of  k  >  1  the  Cascading 
Multi-Band  Algorithm  has  a  competitive  multiplier  of  at  most 


1  +  2  n  min 

(0<1p<n~n=U!+ip) 


<  max 

i 


ki' 


U!  + 


(k^-1) 
(Jfc^.- 1) 


> 


(1) 


PROOF. 


The  discussion  above  demonstrated  that. 


C(F) 

-  -  —  <  max 
V  (T)  i  <i<ip 


2  nkW 


{k*-n 


=  2 n  max 
!<(<  + 


0 


0-  i) 


(2) 


Since  this  is  true  for  any  setting  of  f  (provided  that  n  =  ui  +  ^),  hence  we  get 


C(F) 

nr) 


< 


2  n  min 

(0<ll<<n;n=U)+f) 


<  max 

l  <i<ip 


hi' 

!  (fegv i) 

(£+’  -l)  - 


Using  the  Lost  Value  lemma  we  get  the  desired  result. 

□ 

11  For  the  case  k  =  1,  the  uniform  value  density  case,  see  remark  6.3  below. 
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Remark  6.2  Note  that  the  Cascading  Multi-Band  Algorithm  does  not  use  migration,  hence  the 
previous  result  holds  both  whether  migration  is  allowed  or  not. 

Remark  6.3  When  k  =  1,  there  is  no  need  for  the  bands’  structure,  hence  the  central  pool  consists 
of  all  the  processors  (u  =  n  and  4'  —  0)-  This  leads  to  a  competitive  multiplier  of  2  +  1. 

Remark  6.4  When  the  number  of  processor  is  odd,  a  similar  result  can  be  obtained. 

6.1  Setting  4 

In  the  following,  we  will  estimate  the  upper  bound  in  theorem  6.2  by  setting12  if)  =  (hence 

i  4'  \ 

W  ~  nink+l  ~  Ink’’ 

The  bound  in  (2)  above  becomes: 


2 n  max 

1  <i<4> 


ki> 


l 


4  ,  ki'-i 

In*  'T  4- 


1 

2 n(kV  —  1)  max 

l  <i<4> 


k4 


i) 


The  left  hand  side  is  obtained  by  multiplying  both  numerator  and  denominator  by  ( k V’  —  1). 
Define  fa(x)  to  be  o+(^_1).  When  a  >  1,  this  function  is  monotone  increasing  with  x  (x  >  0).  For 

_i_ 

» =  - 1).  a  is  bigger  than13  1.  This  means  that  the  maximum  above  is  attained  at  i  —  4' 

and  the  upper  bound  (equation  2)  is: 


x 

2 n(0  -  1) 


-!)  +  (*-!) 


< 


X 

2 n(0  -  1) 


1  +  (k  -  1) 


X 

=  2 n(k^  -  1) 


(3) 


We  have  just  proved  the  following  lemma: 


Lemma  6.3  The  Cascading  Multi-Band  Algorithm  has  a  competitive  multiplier  of  at  most 

In  k 


1  +  2n(k  V’  —  1),  where  4'  = 


In  k  +  1 

(recall  that  the  complexity  bound  is  bigger  than  2n{k^  —  1)^1  □ 


(4) 


2  In  is  the  natural  logarithm 

i 

s  i  kV -i 

-  Because  4 '  ln  k  is  a  monotone  decreasing  function  of  4 '  tending  to  1  when  4 '  goes  to  infinity. 
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Recall  that  ip(kk'  —  1)  tends  to  In  A;  as  ip  approaches  infinity.  Hence,  when  the  number  of  proces¬ 
sors  tends  to  infinity,  equation  (4)  above  tends  to 

1  +  2  lim  !  '(fc%  -  1)  =  1  +  2lnA'  |  1  Ink  =  1  +  2 (In k  +  1)  (5) 

ip — >oo  m  A;  k  In  A’  k 

Corollary  6.4  The  ratio  between  the  complexity  bound  and  the  algorithmic  guarantee  is  at  most 

1  +  2 n{ki'  -  1) 

^nikt  -  1) 

When  k  is  held  fixed  and  n  tends  to  infinity  this  ratio  tends  to  ^2  +  .  Which,  when  k  tends 

to  infinity  tends  to  a  constant  of  2. 

PROOF. 

Recall  our  complexity  bound  of  2n-^f-^(k^  —  1).  This  bound  tends  to  ^j-lnA  when  n  tends  to 
infinity.  The  limit  of  the  ratio  is  the  ratio  of  the  limits  which  is: 

1  +  2  (In  A  +  l)_A—  1  3 

In  k  k  In  A 

Which  gives  the  desired  result. 

□ 

Remark  6.5  In  the  discussion  above  we  have  chosen  to  ignore  the  fact  that  w  and  ip  must  be 
integers.  We  can  take  care  of  that  by  setting  V  as  the  nearest  integer  to 

6.2  Distributed  vs.  Centralized  Scheduler 

We  discuss  here  architectures  with  large  number  of  processors.  Hence,  it  is  necessary  to  see  which 
portions  of  the  scheduler  are  centralized  and  which  are  distributed.  The  Cascading  Multi-Band 
Algorithm  uses  a  central  scheduler  in  order  to  assign  a  task  to  a  band  (at  task  release  time  and 
LST).  This  means  that  the  centralized  scheduler  has  all  the  information  regarding  tasks  assigned 
to  each  band  and  their  parameters14.  Once  a  task  is  assigned  to  a  band  it  is  left  in  the  hands  of 
the  local  scheduler  (which  basically  employs  Earliest- Deadline- First). 

It  is  desirable  for  reasons  of  fault-tolerant  and  efficiency  ([13])  to  distribute  the  functionality  of 
the  centralized  scheduler  among  the  processors.  This  is  an  interesting  and  important  extension  to 
the  work  presented  here. 

14  Since  all  the  tasks  go  through  the  central  scheduler  this  is  not  difficult  to  do. 
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6.3  The  Cascading  Multi-Band  Algorithm  Scheduling  Overhead 

In  the  previous  sections  we  analyzed  the  performance  of  our  algorithms  in  the  sense  of  their  competi¬ 
tive  multipliers.  In  this  section  we  study  the  cost  of  executing  the  scheduling  algorithms  themselves. 

What  is  the  cost  of  testing  whether  a  newly  arriving  task  can  be  added  to  Q_privileged  containing 
N  tasks  with  out  causing  overload?  This  can  be  done  in  O (log  jV)  operations  using  a  2-3  tree  that 
holds  slack  times  with  sums  of  the  slack  times  from  left  siblings  held  in  interior  nodes.  If  the  task 
is  to  be  added  to  Q -privileged  the  updating  of  the  2-3  trees  involved  takes  also  O(logiV)  time. 

Let  M  be  a  bound  on  the  the  total  number  of  ready  tasks  at  any  given  moment  in  Q-waiting 
and  any  of  the  local  queues. 

When  a  task  is  released  it  may  have  to  be  checked  against  all  bands  (suppose  the  task  cascades 
from  the  highest  band  all  the  way  to  the  lowest)  with  a  total  cost  of  0(nlog  M). 

A  task  in  Q -waiting  awaits  its  LST.  Hence,  Q -waiting  is  a  2-3  tree  organized  according  to  Latest 
Start  Time.  Inserting  and  removing  a  task  from  this  queue  costs  O(logM)  operations. 

A  task  during  its  lifetime  causes  exactly  one  task  release  event  and  at  most  one  LST  interrupt. 
Hence,  the  scheduling  overhead  per  task  is  O(nlogM). 

6.4  Two  Processor  Systems 

In  this  section  we  discuss  the  case  of  uniform  value  density  (i.e.,  k  =  1).  Wang  and  Mao  [3,12] 
showed  a  lower  bound  of  2  (on  the  competitive  multiplier)  and  presented  an  algorithm  that  achieved 
this  bound,  assuming  tasks  have  no  slack  time.  Building  on  this  work  we  have  developed  an  al¬ 
gorithm,  the  Safe-Risky  algorithm  that  achieves  a  competitive  multiplier  of  2  even  when  tasks 
may  have  slack  time  but  migration  is  allowed.  Hence,  the  Safe-Risky  algorithm  is  optimal.  For 
the  “No- Migration”  model,  a  variant  of  the  Safe-Risky  algorithm,  called  the  Safe-Risky-(fixed) , 
achieves  a  competitive  multiplier15  of  3.  We  do  not  know  whether  the  second  algorithm  is  optimal. 
For  reasons  of  space  we  omit  here  the  details  of  these  algorithms  and  their  analysis,  these  can  be 
found  in  appendix  A  at  the  end  of  this  paper. 


lj  Tasks  may  have  slack  time. 
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7  Conclusion 


number  of  importance 

bounds 

processors 

ratio 

complexity 

algorithmic 

comments 

1  : 

any  k  >  1 

tight 

tight  bound  achieved  by  Y)over  [7]. 

2  : 

1 

2 

tight 

tasks  have  no  slack  time  and  may  not 
migrate  between  processors  [3,12]. 

2  : 

1 

2 

3  + 

tasks  may  have  slack  time  but  may  not 
migrate  between  processors. 

2  : 

1 

2 

tight  + 

tasks  may  have  slack  time  and  may  mi¬ 
grate  between  processors. 

n  :  k  >  1 

(k-i  )<kn  !)  + 

1  +  n(k  1':  -  1)  + 
where  V-  =  ” 

general  algorithmic  guarantee  can  be 

seen  in  theorem  6.2 

n»  2  :  k >  1 

(*-il  ln*  1 

i^^hhski 

asymptotic  behavior 

The  above  table  summarizes  the  current  state  of  the  art  of  competitive  real  time  scheduling. 
Here,  n  is  the  number  of  processors  in  the  system;  k  is  the  importance  ratio ,  that  is  the  highest 
possible  value  per  unit  of  computation  time  that  any  task  can  possibly  obtain  (normalizing  the 
lowest  to  1).  The  bounds  are  expressed  in  terms  of  competitive  multipliers.  Results  marked  with  + 
are  part  of  this  paper. 

A  gap  remains  between  the  guarantees  achieved  by  the  Cascading  Multi-Band.  Algorithm  and 
the  complexity  bounds  we  have  proved.  The  algorithmic  guarantee  is  within  a  multiplicative  factor 
of  2  from  the  complexity  bound  for  large  enough  n  and  k  (the  graphs  on  the  next  page  show  that 
the  asymptotic  behavior  is  attained  even  for  small  values  of  n).  When  the  importance  ratio  of  a 
system  ( i.e. ,  k )  is  close  to  1,  a  different  treatment  is  needed.  Some  work  in  this  direction  has  been 
done  by  Bar-Noy  et.  al.  [1], 

It  is  possible  that  a  better  choice  of  ip  will  lead  to  a  better  exact  expression  of  the  algorithmic 
guarantee  for  our  algorithm.  But  it  seems  that,  asymptotically  we  can  not  do  better  without  chang¬ 
ing  our  algorithmic  techniques.  The  reason  is  that  our  basic  block,  the  scheduling  algorithm  for  a 
2-processor  band  concentrates  its  efforts  on  one  processor  at  a  time  (SP);  the  other  processor  RP, 
is  essentially  left  idle.  Hence,  the  Cascading  Multi-Band  Algorithm  automatically  loses  a  factor  of 
2  compared  to  a  clairvoyant  scheduling  algorithm  that  utilizes  all  the  processors  concurrently.  Of 
course,  one  can  suggest  heuristics  that  will  use  a  processor  whenever  possible16,  the  true  challenge 
is  to  show  that  such  a  heuristic  achieves  a  better  worst  case  performance  guarantee.  Another  way 
to  improve  the  algorithmic  guarantee  will  be  to  come  up  with  a  better  algorithm  for  an  m-processor 
band  (for  some  m  >  2). 

16 Heuristic  improvements  can  be  obtained  by  looking  ahead  in  the  waiting  and  privileged  queues  and  by  scheduling 
tasks  on  the  Risky  Processor  before  tasks  arrive  at  their  latest  start  times. 
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An  important  issue  is  how  to  account  for  migration  overhead  in  such  a  system.  For  example,  we 
modeled  NUMA  architectures  by  forbidding  migration  but  that  is  clearly  too  strong  a  restriction. 
Permitting  migration,  but  at  a  cost,  would  have  been  much  more  reasonable. 

There  are  many  other  open  issues  that  must  be  addressed  for  overloaded  systems  in  a  fault- 
tolerant  context.  An  important  issue  is  how  to  reallocate  processors  when  a  failure  occurs.  The 
fact  that  the  algorithms  here  execute  competitively  for  any  number  of  processors  does  not  mean 
they  will  execute  competitively  if  the  number  of  processors  changes  during  the  computation.  Han¬ 
dling  such  changes  in  an  environment  where  failures  can  occur  with  high  frequency  is  still  to  be 
done. 
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A  APPENDIX:  Two  Processor  Systems 


This  section  deals  with  systems  having  only  two  processors.  We  assume  uniform  value  density 
(i.e.,  k  =  1).  With  out  loss  of  generality  we  can  assume  that  the  value  density  is  normalized  to  1. 
We  consider  two  possible  models  of  multiprocessor  systems.  In  the  first  model,  tasks  can  migrate 
cheaply  (and  quickly)  from  one  processor  to  another.  Hence,  if  a  task  started  to  execute  on  one 
processor  it  can  later  continue  on  any  other  processor  (and  migration  takes  no  time).  We  present  a 
scheduling  algorithm  called  the  Safe-Risky  algorithm  for  this  model.  In  the  second  model  (the  fixed 
model),  once  a  task  starts  to  execute  on  one  processor  it  can  not  execute  on  any  other  processor. 
An  on-line  scheduler  can  do  better  when  migration  is  possible. 

A.l  The  Safe-Risky  Algorithm 

In  this  algorithm,  one  processor  is  designated  as  the  Safe  Processor  ( SP)  and  the  other  as  the  Risky 
Processor  (RP).  A  task  that  started  to  execute  on  SP  is  called  “privileged”  because  it  is  guaranteed 
to  complete  (Q -privileged  is  a  queue  containing  these  tasks). 

Ready  tasks  that  are  not  privileged  wait  in  Q_w ailing  until  they  become  privileged  or  they  reach 
their  LST,  at  that  point  the  LST  task  tries  to  be  scheduled  on  RP.  It  will  be  scheduled  if  it  has  a 
bigger  value  than  the  task  currently  executing  on  RP.  A  task  that  started  to  execute  on  RP  can 
be  preempted  and  then  resume  on  SP  (i.e.,  migrate  to  SP).  In  this  version  of  the  algorithm  the 
designation  as  safe  or  risky  processor  is  fixed.  In  the  no-migration  version  (the  Safe-Risky-(fixed)) 
the  processors  may  switch  roles. 


The  following  few  boxes  depict  the  code  of  the  Safe-Risky  algorithm: 


(* 

the  Safe-Risky  algorithm:  a  competitive  scheduling  algorithm  for  two 

processors  systems  in  the  uniform  value  density  case  *) 

1 

Initilzation  : 

(*  Q-waitingand  Q -privileged  are  initialized  to  the  empty  queue.  One 
processor  is  designated  as  the  safe  processor  (SP)  and  the  other  one  is 
designated  as  the  risky  processor  (RP). 

In  the  beginning  both  processors  are  idle.  *) 

2 

Q -privileged  := 

3 

Q -waiting  :=  <p: 

4  loop  : 

5 

Task  Release  :  (*  T  is  released  *) 

6 

if  (SP  is  idle  )  then 

7 

schedule  T  on  SP; 

8 

else  if  ( T  has  earlier  deadline  than  the  current  task  on  SP 

and  can  be  scheduled  to  completion  with  the  current  task 
as  well  as  with  all  other  privileged  tasks)  then 

9 

preempt  current  task; 

10 

add  the  current  task  to  Q ^privileged', 

11 

schedule  T  on  SP; 

else  (*SP  is  not  idle  and  T  can  not  be  scheduled  on  SP  *) 

12 

add  T  to  Q -waiting: 

endif 

13 

end  ( *task  release  *) 

14 

LST: 

(*  T  reached  its  LST;  LST  denotes  the  latest  start  time  of  T,  i.e.,  the 
moment  when  the  computation  time  remaining  for  T  equals  the  time 
to  its  deadline 

Note  that  scheduling  T  on  SP  will  cause  overload  (with  all  the  other 
privileged  tasks)  *) 

15 

if  ( RP  is  idle)  then 

16 

schedule  T  on  RP; 

17 

else  ( *RP  is  not  idle  and  T  can  not  be  scheduled  on  SP  *) 

18 

let  Trp  be  the  current  task  on  RP; 

19 

if  (  va lue(T)  >  va lue(Tpp))  then 

20 

abandon  Tpp: 

21 

schedule  T  on  RP; 

22 

else 

23 

abandon  T; 

24 

endif  (* comparing  values  *) 

25 

endif  (*RP  is  not  idle  *) 

26 

end  ( *LST  *) 
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27  Task  Completion  : 

(*  on  SP;  There  is  no  special  event  when  a  task  completes  on  RP  *) 

28  let  T  be  the  task  with  earliest  deadline  in  Q_waiting: 

29  if  (T  has  earlier  deadline  than  the  current  task  on  SP 

and  can  be  scheduled  with  all  other  privileged  tasks)  then 

30  remove  T  from  Q -waiting] 

31  add  T  to  Q -privileged ; 

32  (*  which  amounts  to  scheduling  T  on  SP  *) 

33  endif  (*T  can  not  be  added  to  Q -privileged  *) 

34  if  (all  the  privileged  tasks  can  complete  with  the 

task  currently  executing  RP)  then 

35  preempt  the  current  task  on  RP; 

36  add  it  to  Q -privileged] 

^  (*  this  task  will  be  scheduled  on  SP  by  the  following  piece  of  code.  This 

is  the  only  place  where  migration  is  used  *) 

38  endif 

39  schedule  on  SP  the  task  with  earliest  deadline  in  Q-privileged  (if  any); 

40  end  ( *task  completion  *) 

41  end  {loop} 

Remark  A.l  Note  an  important  difference  between  the  above  algorithm  (and  for  that  matter  also 
Y)Over  anj  ^ie  version  of  the  Cascading  Multi-Band  Algorithm  presented  earlier.  In  the  task 
release  routine  a  newly  arrived  tasks  will  be  added  to  Q-privileged  only  if  its  deadline  is  earlier  than 
the  currently  executing  task  and  also,  it  can  be  added  to  Q -privileged  without  creating  overload 
(we  call  that  a  local  schedulabilty  test.  This  stands  in  contrast  to  the  global  schedulabilty  task  used 
by  the  Cascading  Multi-Band  Algorithm.  That  is:  a  new  task  enters  Q-privileged  if  and  only  if  it 
does  not  create  overload  irrespectively  of  its  deadline. 

A. 2  The  Competitive  Multiplier  of  the  Safe-Risky  algorithm 

In  this  section  we  show  that  the  Cascading  Multi-Band  Algorithm  has  a  competitive  multiplier  of 
2.  Our  approach  is  to  partition  the  execution  into  disjoint  intervals,  the  Cascading  Multi-Band 
Algorithm  gets  at  least  half  the  optimal  value  in  each  interval.  This  is  proved  by  means  of  “cover¬ 
ing”.  We  show  that  all  intervals  are  covered;  if  a  period  is  covered  then  the  Cascading  Multi-Band 
Algorithm  uses  at  least  one  processor  effectively  during  all  that  period.  The  clairvoyant  scheduler 
could  gain  a  factor  of  two  by  utilizing  both  processors. 

Definition  A. 2  earliness,  latest_affected,  cover_end 

iii 


By  definition  the  tasks  of  Q-privileged  constitute  an  underloaded  subsystem  (i.e,  all  these  tasks 
can  complete  on  one  processor). 

Now,  let  us  consider  one  additional  task,  1  .  which  has  a  deadline  earlier  than  all  the  tasks  in 
Q -privileged.  Suppose  that  the  system  comprised  of  Q -privileged  plus  T  is  overloaded.  We  want 
to  know  which  of  the  tasks  of  Q-privileged  conflict  with  T. 

Compute  the  earliness  of  all  the  tasks  in  Q-privileged.  That  is,  suppose  the  tasks  are  scheduled 
according  to  earliest- deadline -first  scheduling  algorithm,  the  earliness  of  a  task  is  how  much  before 
its  deadline  it  completes  (i.e,  its  deadline  minus  its  completion  time).  Any  task  with  earliness 
smaller  than  T’s  computation  time  in  Q-privileged  prevents  T  from  being  added  to  Q-privileged. 

Define  latest _affected(T)  to  be  the  latest  deadline  of  any  such  task  (i.e,  a  tasks  whose  earliness 
is  smaller  than  T’s  computation  time)  and  define  cover -end(T)  to  be, 


□ 


cover  -end(T)  =  max{  deadline  (T),  latest -affect  ed(T)} 


Definition  A. 3  INTERVAL 

An  interval  starts  when  a  task  is  released  to  an  idle  system  (the  system  becomes  non-idle)  and 
ends  when  the  system  becomes  idle  again  (both  processors  are  idle). 

□ 


In  the  following,  we  are  going  to  analyze  the  Safe-Risky  algorithm  interval  by  interval.  For 
the  sake  of  notational  convenience  let  us  assume  that  there  was  only  one  interval  (call  it  I)  and  it 
started  at  time  0. 


Lemma  A.l  The  value  obtained  by  the  Safe-Risky  algorithm  from  tasks  executed  on  SP  during 
an  interval  is  at  least  the  interval  length  (duration). 

PROOF. 

From  the  beginning  of  the  interval  to  its  end  SP  is  never  idle  (when  SP  is  idle  so  is  RP  which  means 
that  the  interval  ends).  A  task  that  was  scheduled  on  SP  is  guaranteed  to  complete.  We  conclude 
that  at  any  given  moment  at  least  one  processor  is  working  on  a  task  that  will  eventually  complete. 
This  proves  the  lemma. 

□ 

Notation  A.4  C(I),  V(I),  VSP(I),  VRP(I) 

For  an  interval  /,  let  C(1 )  and  V(I)  denote  the  value  obtained  by  the  clairvoyant  algorithm 
and  the  Safe-Risky  algorithm  (respectively)  from  tasks  released  during  /.  Similarly,  let  VSP 
and  VRP(I)  denote  the  portion  of  V(I)  obtained  by  the  Safe-Risky  algorithm  on  SP  and  RP 
respectively. 

□ 


IV 


Definition  A. 5  conflicted  task,  max_cover 

We  say  that  T  is  a  conflicted  task  if  it  conflicted  with  the  tasks  of  Q_privileged  (when  it  was 
released  or  when  considered  to  execution  at  its  latest  start  time)  and  hence  was  diverted  to  RP 
(and  later  was  either  scheduled  to  execution  or  was  abandoned) 

Define  max  mover  as, 

max  mover  =  max  mover  (I)  ^  max  covermnd(T ) 

T  is  a  conflicted  task 

If  there  is  no  conflicted  task  this  max  equals  0. 

□ 

Lemma  A. 2  For  every  task  T ,  ifT  is  a  conflicted  task  (i.e,  was  diverted  to  RP  by  the  Safe-Risky 
algorithm)  then1' ,  the  total  value  obtained  by  the  Safe-Risky  algorithm  is  at  least  covermnd(T) 
PROOF. 

Recall  that, 


cover mnd(T)  =  max{  deadline  (T),  latest mffected(T)} 


•  First,  let  us  show  that  V(I)  >  deadiine(T). 

T  is  diverted  to  RP  at  its  latest  start  time  which  is  at  deadline(T )  —  computation_time(T). 
This  means  that  the  interval  length  is  at  least  deadline(T)  —  computationJime(T )  (recall  that 
the  interval  starts  at  time  0).  Lemma  A.l  says  that  the  value  obtained  by  the  Safe-Risky 
algorithm  on  SP  is  at  least  deadline(T)  —  computation_time(T). 

If  T  does  not  complete,  it  can  be  abandoned  only  in  favor  of  a  bigger  valued  task  (scheduled 
on  RP),  hence  the  value  obtained  on  RP  is  at  least  value(T ).  Hence, 


V(I)  =  VSP(I)  +  VRP(I) 

>  (deadiine(T)  —  computation  J,ime(T))  +  value{T ) 

=  ( deadline(T )  —  computation  J,ime(T))  +  computation_time(T ) 

=  deadline  (T) 

•  Now,  we  are  going  to  show  that  V(I)  >  latest mffected(T). 

By  definition,  latest _affected(T)  is  the  deadline  of  some  task  Taffecte<i.  Let  factual  be  the 
time  when  Tajfecte(i  completed  (according  to  the  Safe-Risky  algorithm),  factual  can  not  be 
any  earlier  than  the  estimated  completion  time  of  T  computed  when  T  was  considered  for 
execution  (the  actual  time  can  be,  in  fact,  later  than  the  estimated  time  if  some  tasks  were 

17 Recall  that  the  interval  starts  at  time  0. 


V 


later  added  to  Q -privileged).  The  length  of  the  interval  is  at  least  l actual-  Lemma  A.l  shows 
that  the  value  obtained  on  SP  is  at  least  this  length. 

As  mentioned  before,  the  value  obtained  by  the  Safe-Risky  algorithm  on  RP  is  at  least 
value(T).  Hence, 


V(T)  =  VSP(I)  +  VRP(I) 

>  tactual  +  computationJime(T) 

>  deadline  (Ta  fleeted) 

=  latest  _affected(T) 

The  second  inequality  restates  the  definition  of  latest_affected(T). 


□ 

Corollary  A. 3  The  value  obtained  by  the  Safe-Risky  algorithm  during  an  interval  is  at  least 

ma x{the  interval  length,  maxjcover} 

□ 

Definition  A. 6  covered,  uncovered,  atomicjnterval 

For  any  interval  /,  let  uncovered  =  uncovered(I)  be  the  set  of  tasks  (released  in  /)  with  dead¬ 
lines  after  max_cover( I) .  Denote  the  remaining  tasks  by  covered  =  covered! I).  I  is  called  an 
atoniicJnterval  if  uncovered(I)  =  0  (the  empty  set). 

□ 

Lemma  A. 4  For  an  atomic Jnterval  I,  the  Safe-Risky  algorithm  achieves  a  competitive  factor  of 
2.  That  is, 


C(I)  <  2 V(T) 


PROOF. 

All  tasks  have  deadlines  before  max_cover.  So  the  best  the  clairvoyant  algorithm  can  do  is  to  sched¬ 
ule  tasks  on  both  processors  from  time  0  to  maxmover  to  get  a  value  of  2  •  maxjcover.  Corollary  A. 3 
ensure  that  the  Safe-Risky  algorithm  gets  a  value  of  maxjcover  at  least.  □ 

Lemma  A. 5  For  every  non  atomic  Jnterval,  I, 

•  All  the  tasks  of  uncovered  are  executed  to  completion  on  SP. 

•  When  a  task  of  uncovered  is  executing  on  SP,  RP  is  idle. 
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•  Suppose  the  tasks  of  uncovered  were  never  released  (i.e,  only  the  tasks  of  covered  were  re¬ 
leased).  Then,  the  tasks  of  covered  will  be  scheduled  (on  SP  or  RP),  preempted,  and  completed 
or  abandoned  at  exactly  the  same  times  as  if  the  tasks  of  both  covered  and  uncovered  were 
released. 

•  Removing  uncovered  from  the  original  set  of  tasks  will  break  the  interval  into  one  or  more 
sub-intervals  separated  by  idle  time. 

PROOF. 

•  A  task  T  is  diverted  to  RP  ( and  possibly  abandoned)  only  if  it  is  involved  in  a  conflict  with  the 
tasks  of  Q ^privileged.  A  task  T  of  uncovered  can  not  be  involved  in  any  conflict  (otherwise, 
max-cover  will  be  at  least  deadline(T )  which  contradicts  the  fact  that  T  is  in  uncovered).  We 
conclude  that  T  was  scheduled  on  SP  (hence  also  completed). 

•  Suppose  that  Trp  is  executing  on  RP.  Trp  was  scheduled  at  its  latest  start  time.  Let  T  be 
the  first  task  of  uncovered  to  be  executed  (on  SP)  concurrently  with  Trp.  Note  that  all  the 
tasks  in  Q -privileged  (at  that  point)  must  be  of  uncovered  because  these  tasks  are  scheduled 
on  SP  according  to  deadline  order  and  the  tasks  of  covered  have  earlier  deadlines. 

When  was  T  scheduled?  It  could  not  be  before  LST(Trp)  because  this  means  that  (at  its 
latest  start  time)  Trp  had  a  conflict  with  tasks  of  uncovered.  So,  T  must  have  been  scheduled 
after  LST{Trp)  as  a  consequence  of  a  task  completion  event18.  But,  if  this  is  the  case,  all 
the  tasks  of  Q -privileged  could  be  scheduled  on  one  processor  along  with  Trp.  Hence  the 
Safe-Risky  algorithm  (in  the  Task- Completion  routine)  would  have  scheduled  Trp  and  then 
all  the  privileged  tasks  on  SP,  a  contradiction. 

•  The  execution  of  a  task  of  covered  on  SP  can  be  delayed  or  interrupted  (i.e,  preempted)  only 
by  tasks  of  covered  (because  all  the  tasks  of  uncovered  have  later  deadlines).  Hence,  the 
execution  of  covered  tasks  on  SP  is  not  affected  by  uncovered  tasks. 

A  task  of  covered  can  not  be  diverted  to  RP  as  a  consequence  of  a  conflict  with  a  task  of 
uncovered  (because  the  tasks  of  uncovered  had  no  conflicts).  Also,  no  task  of  uncovered  will 
be  scheduled  on  RP.  This  implies  that  the  execution  of  covered  tasks  on  RP  is  not  at  all 
affected  by  uncovered  tasks. 

•  We  saw  above  that  removing  the  tasks  of  uncovered  will  not  change  the  execution  history 
of  all  other  tasks.  Also,  when  a  task  of  uncovered  is  executing  on  SP,  RP  is  idle.  Hence, 
periods  in  which  only  tasks  of  uncovered  execute  in  the  execution  of  (coveredU  uncovered ) 
will  correspond  to  idle  periods  in  the  execution  of  covered. 

□ 

18 It  can  not  be  a  consequence  of  a  task  release  event  because  a  task  of  uncovered  can  preempt  only  tasks  of 
uncovered,  in  which  case  the  task  that  precede  T,  on  SP,  is  the  first  task  of  uncovered  to  coincide  with  Trp. 
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Lemma  A. 6  For  any  interval  I,  the  Safe-Risky  algorithm  guarantees  that  the  competitive  multi¬ 
plier  can  not  be  greater  than  2.  I.e, 


C(I)  <  2 V(I) 

PROOF. 

The  proof  is  by  induction  on  the  structure  of  the  interval. 

Basis:  If  the  interval  is  atomic  Jnterval  then  lemma  A. 4  ensures  that  the  induction  hypothesis 
holds. 

Induction  step:  Suppose  the  induction  hypothesis  is  known  for  all  the  sub-intervals  /l5  J2,  •  ■  ■ ,  Im 
of  I  (after  removing  the  tasks  of  tmcovered(I)).  By  the  induction  hypothesis,  C'(Ij)  <  2 1  ( If)  for 
all  j. 

But19, 


C(I)  =  C(h  U  /2  U  ■  ■  ■  U  /m  U  uncovered(I)) 

<  C(I i)  +  C(/2)  + - h  C(Im )  U  C(uncovered(I )) 

<  2F(/i)  +  2 V(I2)  •  •  ■  +  2 V(Im)  +  C(  uncovered(I)) 

=  2V(I1)  +  2V(I2)  •  •  •  +  2 V{Irn)  +  value(uncovered(I )) 

Lemma  A. 5  shows  that  the  tasks  of  uncovered(I)  do  not  affect  the  scheduling  of  tasks  in  the 
sub-intervals  (by  the  Safe-Risky  algorithm).  Hence, 

V (/)  =  V(Ii)  +  F(/2)  ■  •  ■  +  V(Im)  +  value(uncovered(I )) 

This  shows  that  the  induction  hypothesis  holds  for  /,  hence  the  theorem  is  proved. 

□ 

Theorem  A. 7  For  a  system  of  uniform  value  density  and  2  processors,  the  Safe-Risky  algorithm 
achieves  the  best  possible  competitive  multiplier  of  2  (when  migration  of  tasks  is  permitted). 
PROOF. 

The  intervals  partition  the  entire  set  of  tasks  T  into  disjoint  subsets  Ti, T2,  •  ■  •. 

C(T)  <  c(ri)  +  c(r2)  +  •  •  • 

<  2(V(r1)  +  F(r2)  +  ■  •  ■) 

=  21  (1) 

The  last  equahty  is  due  the  fact  that  tasks  of  one  interval  do  not  influence  the  scheduling  decisions 
(of  the  Safe-Risky  algorithm)  in  another  interval.  This  might  not  be  the  case  for  the  clairvoyant 
algorithm,  that  is  why  the  first  inequality  is  not  an  equality. 

□ 

19 Recall  that  all  the  tasks  of  uncovered(I)  are  scheduled  to  completion  by  the  Safe-Risky  algorithm.  Hence, 
C'( uncovered  (/))  =  value  ( uncovered  (I))  =  V(  uncovered  (/)). 
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A. 3  When  Task  Migration  is  Not  Allowed 

By  task  migration,  we  mean  moving  a  task  that  has  already  partially  executed  from  one  processor 
to  another.  In  the  case  that  task  migration  is  not  allowed,  we  can  use  a  simple  variant  of  the 
Safe-Risky  algorithm  to  get  a  competitive  multiplier  of  3  for  tasks  with  slack  time20.  We  call  this 
variant  the  Safe-Risky-(fixed). 

A. 3.1  The  Safe-Risky-(fixed)  Algorithm 


The  only  place  that  the  Safe-Risky  algorithm  used  the  possibility  of  task  migration  is  in  the  task 
completion  routine.  For  that  reason,  the  Safe-Risky-(fixed)  differs  from  the  Safe-Risky  algorithm 
only  in  this  routine. 


42 

Task  Completion  : 

(*  on  SP;  There  is  no  special  event  when  a  task  completes  on  RP  *) 

43 

let  T  be  the  task  with  earliest  deadline  in  Q_w ailing: 

44 

if  (  T  has  earlier  deadline  than  the  current  task  on  SP 

and  can  be  scheduled  with  all  other  privileged  tasks)  then 

45 

remove  T  from  Q -waiting: 

46 

add  T  to  Q -privileged] 

47 

(*  which  amounts  to  scheduling  T  on  SP  *) 

48 

endif  (*T  can  not  be  added  to  Q -privileged  *) 

49 

schedule  on  SP  the  privileged  task  with  earliest  deadline  (if  any); 

50 

if  (SP  is  idle)  then 

51 

switch  roles  between  the  SP  and  RP  processors; 

52 

endif  ( *SP  is  idle  *) 

53 

end  (*task  completion  *) 

A. 3. 2  Analysis 

We  define  latest  ^affected  and  all  the  other  definitions  as  before.  Observe  that  lemmas  A.l  and  A. 2 
hold  for  the  new  algorithm  as  well. 

Recall,  that  F  denotes  the  set  of  tasks  abandoned  by  the  on-line  algorithm 
Lemma  A. 8  For  every  interval,  I, 

C(F)  <  21  (/) 

20When  tasks  have  no  slack  time  an  optimal  algorithm  with  competitive  multiplier  of  2  is  known  [12,3]. 
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PROOF. 


All  tasks  that  were  abandoned  by  the  Sa,fe-Risky-(fixed)  were  diverted  to  RP.  Let  d  be  the  maxi¬ 
mal  deadline  among  all  tasks  of  F  21 .  The  best  the  clairvoyant  can  do,  with  tasks  of  F.  is  to  schedule 
both  processors  continuously  to  get  a  value  of  2d.  Lemma  A. 2  shows  that  the  Sa,fe-Risky-(fixed) 
gets  a  value  of  at  least  d. 

□ 

Theorem  A. 9  For  any  interval  I  the  competitive  multiplier  of  the  Sa,fe-Risky-(tixed)  algorithm 
is  less  than  or  equal  to  3.  I.e, 


C(I)  <  3 V{I) 


PROOF. 

This  follows  from  the  Lost  Value  Lemma  (4.1)  and  lemma  A. 8. 

□ 


21If  there  is  no  such  maximal  deadline.  Then  the  supremum  is  either  finite  or  infinity.  If  it  is  finite  a  similar  proof 
will  work,  if  it  is  infinity,  then  the  interval  is  infinitely  long,  in  which  case  the  value  obtained  by  the  on-line  algorithm 
is  infinity. 
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B  APPENDIX:  The  Complexity  Bound;  Few  Lemmas 


In  section  3  we  proved  the  complexity  result  in  the  special  case  that  only  one  value  density  level 
is  missing  from  the  on-line  schedule.  But  what  will  happen  if  more  than  one  level  is  missing?  In 
this  appendix,  we  will  show  that  this  can  not  benefit  the  on-line  scheduler  Hence,  the  complexity 
bound  holds  in  the  general  case. 

Definition  B.l  PRESENT(t),  $(«),  select(t) 

For  any  time  /,  look  at  the  tasks  that  are  currently  being  executed  by  the  on-line  scheduler.  For 
each  density  level  i,  0  >  i  >  n: 

Define  present(i)  to  be  the  number  of  tasks  from  level  i  that  are  executing  at  time  t. 

Define  the  following  potential  function 

i 

$(  i)  =  (i  +  1)  —  ^2  present(j) 
j=o 

Define  select(t )  to  be  the  minimal  index  i  for  which  d>(i)  >  1  (Note  that,  d>(0)  <  1  and  <J>(n)  >  1). 

□ 

We  will  start  by  proving  two  properties  of  $  and  select. 

Lemma  B.l  For  all  t, 

1.  present(select(t ))  =  0  (i.e.,  No  task  of  level  select[t)  is  executing  at  time  t). 

2.  $>(select(t))  =  1  (i.e,  equals  one  rather  then  greater  than  1). 

PROOF. 

There  are  two  cases:  select(t )  =  0  and  select(t)  >  0. 

•  select(t )  =  0. 

From  the  definition  of  $,  d>(0)  <  1.  But,  select(t)  =  0  hence  $(0)  >  1.  We  conclude  that 
$(0)  =  1  which  implies  that  present^)  =  0. 

•  select(t)  >  0. 

Let  select(t)  —  i  +  1  (for  i  >  0).  Note  that  by  definition, 

$(i  -f  1)  =  $(i)  +  1  -  present{  i  +  1)  (6) 

Assume,  in  order  to  get  a  contradiction,  that  present(i  +  1)  >  1  then  $  (0  >  $(i+  1)  >  1. 
Therefore,  $  (0  >  1-  This  is  a  contradiction  to  the  fact  that  select(t)  is  the  minimal  index  i 
with  $(i)  >  1.  We  conclude  that  present(i  +  1)  =  0. 

Substituting  present(i  +  1)  =  0  in  (6)  above  gives  1  <  +  1)  =  $(i)  +  1.  Since  d>  (0  <  0 

(otherwise  i  +  1  is  not  minimal),  $(«  +  1)  =  1. 
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□ 


Lemma  B.2  At  time  t,  let  1\.  I z-  ■  ■  -Tm  be  the  tasks  of  levels  0  to  select{t)  —  1  scheduled  by  the 
on-line  scheduler.  Then,  there  is  a  one  to  one  correspondence  f  from  these  tasks  to  all  the  density 
levels  from  0  to  select(t)  —  1  so  that  f{T )  is  greater  than  or  equal  to  the  value  density  of  T.  In 
other  words,  we  can  promote  all  tasks  with  level  density  below  select{t )  to  a  higher  (or  equal)  value 
density  (but  still  below  select(i)  —  1)  so  that  after  the  promotion,  only  one  task  corresponds  to  any 
given  value  density  level. 

PROOF. 

Assume  that  the  tasks  Tj.,  T-j,  ■  •  • ,  Tm  are  ordered  by  non- decreasing  order  of  value  densities.  Define 
/  as  follows, 

f(Tr)  =  r-  1 

(That  is,  f(Tr )  is  the  position  in  the  list  starting  from  zero) 

We  know  that, 


1  =  (f>(select(t )) 

select{t)  —  1 

=  select(t )  +  1  —  ^2  present(j) 

j=o 

=  select(i )  +  1  —  m 

Therefore,  select(t)  =  m.  Hence,  /  is  a  one  to  one  function  onto  [0 ,.(select(t)  —  1)]. 

We  still  have  to  show  that  for  every  Tr,  f{Tr )  is  at  least  the  value  density  level  of  T.r.  If  the  level 
of  Tr  is  0  then  there  is  nothing  to  prove  since  /  >  0,  so  assume  that  Tf  s  level  is  *  +  1  for  some  i. 


0  >  $(*)  =  i  +  1  -  Y]  present  (j ) 

j=o 

Therefore,  )_0  present^ )  >  i  +  1.  Hence,  there  are  at  least  i  +  1  tasks  with  value  densities 
below  Tf  s  value  density.  This  means  that  Tr' s  position  in  the  ordered  list  is  not  before  the  «  +  2’th 
place,  so  f(TT )  >  (  i  +  2)  —  1  >  i  +  1. 

□ 

Now  we  are  ready  to  state  and  prove  the  version  of  lemma  3.1  for  the  general  case  (i.e.,  when 
more  than  one  level  is  missing). 

Lemma  B.3  Suppose  the  on-line  scheduler  gets  partial  value  for  tasks  that  were  executed  but  not 
to  completion.  In  particular,  for  a  task  that  executed  for  a  (0  <  a  <  1)  of  its  computation  time  the 
scheduler  will  get  a  times  the  task's  value. 
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Then,  the  effective  value  density  (i.e,  the  value  achieved  divided  by  the  period’s  length)  obtained 
by  the  clairvoyant  scheduler  between  ti  and  is  at  least  ^  n(k'«  —  1)  times  bigger  than  the 
effective  value  density  obtained  by  the  on-line  scheduler  for  the  same  period. 

PROOF. 

Suppose  Tu-Za,  •  ■  -  Tm  are  the  tasks  scheduled  at  ti  with  value  densities  i\  <  *2  <  •  •  -im  <  selectfti). 
Denote  selectftfj  by  i0. 

The  clairvoyant  algorithm  schedules  n  tasks  of  level  i0 ,  there  are  no  more  tasks  releases  until  the 
end  of  ?'o’s  period.  Hence  the  ratio  between  the  effective  value  densities  is  at  least: 


nk' 


,  11.  ,12.  ,1m.  xx 

(k  «  +  k  «  +•••  +  «"  +  ( n  —  m)e) 

But,  if  we  replace  the  value  density  of  a  task  T  by  f{T)  (/  as  in  lemma  B.2)  then  the  denominator 
does  not  decrease  hence  the  ratio  does  not  increase. 


nk~ 


/ ,  11  ,  12.  ,  im.  x  x 

[k  n  +K"  +••■  +  «"  +  (n  —  mjcj 


> 


> 


nk  ° 


,  ,  tii  1  1  ,  J  /(’m)  .  . 

(k  «  T  k  "  +---  +  K  «  +  (n  —  mjej 


nk' 


+  k«  +  ■  ■  ■  +  k»  +  (n  —  m)e) 


We  saw,  in  lemma  3.1  above,  that  the  above  ratio  is  not  smaller  than 


- 1) 


xiii 


and  the  lemma  is  proved. 

□ 


